SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: SAGAWA, HIROAKI 
UENO, HARUMI 
OSHIMA, ATSUSHI 
KATO, IKUNOSHIN 

(ii) TITLE OF INVENTION: PLASMID 

(iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

{A} ADDRESSEE: BIRCH, STEWART, KOLASCH & BIRCH, LLP 

(B) STREET: PO BOX 747 

(C) CITY: FALLS CHURCH 

(D) STATE: VA 

(E) COUNTRY: USA 

(F) ZIP: 220 4 0-Q747 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION : 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: WEINER, MARC S. 

(B) REGISTRATION NUMBER: 32,181 

(C) REFERENCE/DOCKET NUMBER: 1422-0319P 

(ix) TELECOMMUNICATION INFOFOyiAT ION : 

(A) TELEPHONE: 703-205-8000 

(B) TELEFAX: 703-205-8050 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s ite 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "2=Val or Leu" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
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Met Xaa Pro Leu Asp Lys Asp Leu Gin Lys Ala Lys lie Ser lie Thr 
15 10 15 

Asp Phe Phe Glu lie Thr Asn Arg Val Leu Asp Tyr Phe Pro Asn Val 
20 25 30 

lie Asn Asn Thr Val Glu Lys Gly Asp Tyr Leu He Ser Ser Ser Asn 
35 40 45 

He Ala Gly Thr He Lys Phe Leu Arg Pro He Asn Arg Lys Leu Phe 
50 55 60 

He Gin Glu Lys Lys Val Phe Asn Asp Tyr Phe Gin Lys Leu He He 
65 70 75 80 

Val Phe Glu Asn He Arg Asn Lys Lys Thr Val Thr Glu Glu Asp Lys 
85 90 95 

He He He Asp Arg Val He Tyr Thr He Gin Gin Ser He Gly He 
100 105 110 

Gly Leu Asp Leu Met Val Asn Gin Asn Ser Ala Arg Lys His Val Gly 
115 120 125 

Asn Arg Phe Glu Glu Leu He Arg' Val He Phe Thr Glu He Ser Val 
130 135 140 

Ser Asn Lys Arg Thr Val Leu Gin He Pro Tyr Glu Thr Asp Glu Gly 
145 150 155 160 

Gin Lys He Tyr Lys Cys Glu Asn Asp Leu He He Ser Pro Phe Glu 
165 170 175 

Asn Val Glu Ser Thr Asn Lys His Leu Asp Glu Asn Glu He Val Val 
180 185 190 

Ser He Lys Thr Thr Ser Lys Asp Arg Met Gly Lys Met Phe He Asp 
195 200 205 

Lys He Leu Leu Glu Arg Phe Val Lys His Pro Gin Lys Val He Gly 
210 215 220 

He Phe Leu Asn Asp Val Gin Arg Lys Glu Asp Asn Asn He Ser Phe 
225 230 235 240 

Thr Leu Val Ser Gly Leu Phe Met Val Tyr Thr Lys Phe Leu Thr Thr 
245 250 255 

Leu Glu Gly He Tyr Tyr Leu Asp Pro Pro Pro Asn Ala Leu Lys Leu 
260 265 270 

Pro Tyr Ser Asn His Met Lys Arg Phe Ser Asp Leu He Thr Glu Asp 
275 280 285 

Leu Glu Lys Leu Phe Ser Ser 
290 295 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTHi^BB base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



ATGSTACCAC 


TGGAT7VAAGA 


TTTACAAAAA 


GCAAAGATTT 


CAATTACTGA 


TTTTTTTGAA 


60 


ATTACAAATA 


GAGTTTTAGA 


TTATTTCCCC 


AATGTAATCA 


ATAATACAGT 


TGAAAAAGGA 


120 


GATTATTTAA 


TATCCTCATC 


7UVATATTGCT 


GGAACAATAA 


AATTCCTAAG 


ACCAATCAAT 


180 


AGAAAGTTAT 


TTATTCAGGA 


AAAAAAAGTT 


TTCAATGATT 


ATTTTCAAAA 


ACTGATTATA 


240 


GTTTTTGAAA 


ATATAAGGAA 


CAAAAAAACT 


GTAACAGAGG 


AAGATAAAAT 


TATTATTGAT 


300 


AGGGTAATTT 


ACACAATACA 


GCAATCTATT 


GGAATTGGTT 


TAGATTTAAT 


GGTTAATCAA 


360 


AATAGTGCTA 


GAAAGCACGT 


TGGTAACCGA 


TTTGAAGAAT 


TAATTAGAGT 


CATTTTTACA 


420 


GAAATATCAG 


TATCGAATAA 


7VAGAACTGTA 


TTACAAATTC 


CATATGAAAC 


TGATGAAGGA 


480 


CAGAAAATTT 


ACAAATGCGA 


GAATGACCTC 


ATTATTTCTC 


CTTTTGAAAA 


TGTAGAATCT 


540 


ACAAACAAAC 


ATCTAGATGA 


AAATGAGATT 


GTTGTTTCAA 


TAAAGACAAC 


ATCAAAAGAT 


600 


AGGATGGGAA 


AAATGTTTAT 


AGATAAAATT 


TTACTTGAAA 


GGTTTGTTAA 


ACACCCTCAA 


660 


AAAGTTATAG 


GGATTTTCCT 


CAATGATGTA 


CAAAGAAAAG 


AAGACAACAA 


TATCAGCTTT 


720 


ACACTTGTTT 


CAGGATTATT 


TATGGTGTAT 


ACTAAATTCT 


TAACTACTCT 


TGAAGGGATC 


780 


TATTATTTAG 


ATCCACCACC 


TAATGCATTG 


AAACTACCAT 


ATTCTAATCA 


TATGAAAAGA 


840 


TTTTCAGATT 


TAATTACAGA 


AGACCTTGAA 


AAATTATTCT 


CCTCT 
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(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
TATGGATATG TTCATAAACA CGCATGTAGG CAGATAGATC TTTGGTTGTG AATCGCAACC 
AGTGGCCTTA TGGCAGGAGC CGCGGATCAC CTACCATCCC TAATGACCTG CAGGCATGCA 
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AGCTTGCATG CCTGCAGGTC ATTAGGTACG GCAGGTGTGC TCGAGGCG/IA GGAGTGCCTG 180 
CATGCGTTTC TCCTTGGCTT TTTTCCTCTG GGACA 215 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

TATGTCCCAG AGGAAAAAAG CCAAGGAGAA ACGCATGCAG GCACTCCTTC GCCTCGAGCA 60 

CACCTGCCGT ACCTAATGAC CTGCAGGCAT GCAAGCTTGC ATGCCTGCAG GTCATTAGGG 12 0 

ATGGTAGGTG ATCCGCGGCT CCTGCCATAA GGCCACTGGT TGCGATTCAC AACCAAAGAT 18 0 

CTATCTGCCT ACATGCGTGT TTATGAACAT ATCCA 215 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
AGATCTAGAG CAAACAAAAA AACCACCG 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 
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(xi) SEQUENCE DES^W^TION: SEQ ID NO: 6: 
GGTCTAGATC CCAGAGGAAA AAAG 2 4 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CTCGAGATTT AGGTGACACT ATAGAJXTACG GA_ATTGTGAG CGGATAACAA TTCCAAGCTT 60 
CACAGGAAAC AGACCATGGC TTAAGTAACT AGTGAATTCG 100 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
CGAATTCACT AGTTACTTAA GCCATGGTCT GTTTCCTGTG AAGCTTGGAA TTGTTATCCG 
CTCACAATTC CGTATTCTAT AGTGTCACCT AAATCTCGAG 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
AATCCCATGG AACGCTACGA ATCTCTG 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

( B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCGGCCATGG TTATTTTTGA CACCAGACC 
(2) INFORMATION FOR SEQ ID NO : 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 
TAACTTGAAT CCATGGGTTC TCACCG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 
TACTCAGTAG CCATGGCTCT CATAGACCG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOG'^Kinear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 3 : 

Met Asn Glu He Ala Phe Asp Asn Tyr Ser Tyr He Pro Lys Leu Lys 
1 5 10 15 

Leu Tyr Ser Glu He Glu Leu Lys Pro Phe Phe He Ser Lys Asn Gly 
20 25 30 

Ser Leu Phe Asn Val Asp Ala He Asp Phe Leu Arg Lys Leu Glu Ser 
35 40 45 

Asn Ser Val Asp Leu He Phe Ala Asp Pro Pro Tyr Asn He Lys Lys 

50 55 60 

Ala Glu Trp Asp He Phe Ser Ser Gin Asn Glu Tyr Leu Glu Trp Ser 
65 70 75 80 

Lys Glu Trp He Met Glu Ala His Arg Val Leu Lys Asp Asn Gly Ser 
85 ^90 95 

Leu Tyr Val Cys Gly Phe Ser Glu He Leu Ala Asp He Lys Phe He 
100 105 110 

Thr Ser Lys Tyr Phe His Ser Cys Lys Trp Leu He Trp Phe Tyr Arg 
115 120 125 

Asn Lys Ala Asn Leu Gly Lys Asp Trp Gly Arg Ser His Glu Ser He 
130 135 140 

Leu Leu Leu Arg Lys Ser Lys Asn Phe He Phe Asn He Asp Glu Ala 
145 150 155 160 

Arg He Pro Tyr Asn Glu His Thr Val Lys Tyr Pro Gin Arg Thr Gin 
165 170 175 

Ala Glu Ser Ser Gin Tyr Ser Asn Ser Lys Lys Gin Tyr He Trp Glu 
180 185 190 

Pro Asn Pro Leu Gly Ala Lys Pro Lys Asp Val Leu Glu He Pro Thr 
195 200 205 

He Ser Asn Gly Ser Trp Glu Arg Ser He His Pro Thr Gin Lys Pro 
210 215 220 

Val Glu Leu Leu Lys Lys He He Leu Ser Ser Ser Asn Lys Asp Ser 
225 230 235 240 

Leu He Leu Asp Pro Phe Gly Gly Ser Gly Thr Thr Tyr Ala Val Ala 
245 250 255 

Glu Ala Phe Gly Arg Lys Trp He Gly Thr Glu Leu Asp Lys Asn Tyr 
260 265 270 

Cys Leu Glu He Gin Lys Arg Leu Lys Asp Glu Ser Met He Asn Arg 
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lie Phe Ser Gly Asp Asp Asp Ser Asn Ser Gin Asn Arg Arg Lys Lys 
290 295 300 

Leu Arg Gly Giu 

305 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GTGAATGAAA TAGCGTTTGA TAATTACAGT TATATACCAA AATTAAAACT TTATTCGGAA 60 

ATCGAGCTTA AACCATTTTT TATTTCAAAA AAC6GTTCAC TTTTCAATGT TGATGCTATT 120 

GATTTTTTAA GAAAATTAGA GAGTAATTCT GTGGATTTAA TTTTTGCAGA TCCACCTTAT 180 

AACATTAAAA AGGCAGAGTG GGATATTTTT TCTTCTCAAA ATGAATATCT CGAATGGAGT 240 

AAAGAATGGA TAATGGAAGC TCATAGAGTT TTAAAAGATA ATGGCAGTTT ATATGTTTGT 300 

GGCTTTTCAG AAATTCTGGC AGACATAAAA TTTATCACTT CAAAATATTT TCACAGTTGT 360 

AAATGGTTGA TTTGGTTCTA TAGAAACAAG GCAAATTTAG GTAAAGATTG GGGACGTTCA 4 20 

CACGAAAGTA TACTGTTATT AAGAAAATCT AAAAATTTTA TTTTTAATAT TGATGAGGCA 4 80 

CGAATCCCGT ATAATGAGCA TACAGTTAAA TATCCACAAA GAACCCAGGC CGAATCTTCG 54 0 

CAATATTCGA ACTCAAAAAA GCAATATATT TGGGAGCCAA ACCCATTAGG AGCTAAGCCA 600 

AAAGATGTTT TGGAGATTCC CACAATTTCA AATGGTTCTT GGGAAAGAAG TATTCACCCT 660 

ACGCAAAAGC CAGTAGAATT GCTTAAAAAA ATAATTTTAT CTTCATCTAA TAAAGATAGT 720 

TTAATTCTTG ATCCATTTGG TGGTTCGGGA ACTACATATG CTGTTGCGGA AGCTTTTGGC 780 

AGAAAATGGA TTGGAuACAGA GTTAGATAAA AATTATTGTC TGGAAATTCA AAAGCGATTG 84 0 

AAAGACGAAA GTATGATCAA CAGGATTTTT TCAGGCGATG ATGATTCAAA TTCTCAAAAT 900 

AGAAGAAAAA AATTAAGAGG AGAA 92 4 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) 



TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA 



ft 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



TCGAGATTTA GGTGACACTA TAGAATACA 



29 



(2) INFORMATION FOR SEQ ID NO : 1 6 : 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) 




(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO : 1 6 : 



AGCTTGTATT CTATAGTGTC ACCTAAATC 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TCGAGATTTA GGTGACACTA TAGAATACGG AATTGTGAGC GGAT7\ACAAT TCCA 5 4 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AGCTTGGAAT TGTTATCCGC TCACAATTCC GTATTCTATA GTGTCACCTA AATC 5 4 

(2) INFORMATION FOR SEQ ID NO : 1 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Leu Pro Leu Asp Lys Asp Leu Gin Lys Ala Lys lie Ser lie Thr 
15 10 15 

Asp Phe Phe Glu 
20 

(2) INFORMATION FOR SEQ ID N0:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/KEY: primer_bind 

(B) LOCATION: 1 . .23 

(D) OTHER INFORMATION: /note= "6, 9, 12 = inosine" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
ATGTTNCCNY TNGAYAARGA YYT 2 3 

(2) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 



91 



(A) NAME/K^BPprimer_bind 

(B) LOCATION: 1 . . 23 

(D) OTHER INFORMATION: /note= "9 = inosine' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 
AAGGATTTNC ARAARGCNAA RAT 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TAAATCTAAA CCAATTCCAA TAGATTGCTG 
(2) INFORMATION FOR SEQ ID NO : 2 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TAAATCTAAA CCAATTCCAA TAGATTGCTG 
(2) INFORMATION FOR SEQ ID NO : 2 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
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GAACTGTATT ACAAATTCCA^PfTGAAACTG 
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(2) INF0R^4ATI0N FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
GACAGAAAAT TTACAAATGC GAGAATGACC 30 
(2) INFORMATION FOR SEQ ID NO : 2 6 ; 

(i) SEQUENCE CHAEIACTERIST ICS : 

(A) LENGTH: 1588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTI(3N : SEQ ID NO:26: 

CCATGGCACA CGTTTCAAAA AAGAAATCCT CGAAGTCAAA TATGATGAGA AAAACATCTC 60 

AGACATCCTG CATATGACGG TGGATGAAGC ATTGGAATTT TTCTCGGAAA ATCACGAAGA 120 

AiWUVTTGTA ACCAAACTAA AACCTTTGCA GGACGTTGGT TTGGGTTATC TTCAGTTAGG 180 

CCAGTCCTCC TCTACTCTTT CCGGCGGTGA AGCCC7U\AGA GTGAAGCTCG CCTCTTTCCT 240 

TGTGAAAGGT GTAACGACGG AAAAAACGTT ATTTGTTTTT GATGAACCAT CAACAGGATT 300 

ACATTTCCAC GACATTCAAA AATTACTGAA ATCACTTCAG GCACTGATAG AATTAGGGCA 3 60 

TTCGGTTGTA GTGATTGAGC ATCAGCCGGA TATTATCAAA TGCGCCGATT ACATCATCGA 4 20 

TGTCGGACCC AATGCCGGAA AATACGGTGG CGAAATTGTT TTCACAGGAA CTCCGGAAGA 4 80 

TTTGGTAAAA GAGAAAAAGT CGTTTACAGG GAAGTATATT AAGGAGAAGT TAAAGTAATT 54 0 

TATTTATATT TGAAGTTATG CTACCACTGG ATAAAGATTT ACAAAAAGCA AAGATTTCAA 600 

TTACTGATTT TTTTGAAATT ACAAATAGAG TTTTAGATTA TTTCCCCAAT GTAATCAATA 6 60 

ATACAGTTGA AAAAGGAGAT TATTTAATAT CCTCATCAAA TATTGCTGGA ACAATAAAAT 72 0 

TCCTAAGACC AATCAATAGA AAGTTATTTA TTCAGGAAAA AAAAGTTTTC AATGATTATT 7 80 
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TTCAAAAACT GATTATAGTT iTTGAAAATA TAAGGAACAA AAAAACTGTATTcAGAGGAAG 84 0 

AT7\AAATTAT TATTGATAGG GTAATTTACA CAATACAGCA ATCTATTGGA ATTGGTTTAG 90 0 

ATTTAATGGT TAATCAAAAT AGTGCTAGAA AGCACGTTGG TAACCGATTT GAJ\GAATTAA 960 

TTAGAGTCAT TTTTACAGAA ATATCAGTAT CGAATAAAAG AACTGTATTA CAAATTCCAT 102 0 

ATGAAACTGA TGAAGGACAG AAAATTTACA AATGCGAGAA TGACCTCATT ATTTCTCCTT 108 0 

TTGAAAATGT AGAATCTACA AACAAACATC TAGATGAAAA TGAGATTGTT GTTTCAATAA 114 0 

AGACAACATC AAAAGATAGG ATGGGAAAAA TGTTTATAGA TAAAATTTTA CTTGAAAGGT 1200 

TTGTTAAACA CCCTCAAAAA GTTATAGGGA TTTTCCTCAA TGATGTACAA AGAAAAGAAG 1260 

ACAACAATAT CAGCTTTACA CTTGTTTCAG GATTATTTAT GGTGTATACT AAATTCTTAA 132 0 

CTACTCTTGA AGGGATCTAT TATTTAGATC CACCACCTAA TGCATTGAAA CTACCATATT 1380 

CTAATCATAT GAAAAGATTT TCAGATTTAA TTACAGAAGA CCTTG/^^^AJ\A TTATTCTCCT 144 0 

CTTAATTTTT TTCTTCTATT TTGAGAATTT GAATCATCAT CGCCTGAAAA AATCCTGTTG 1500 

ATCATACTTT CGTCTTTCAA TCGCTTTTGA ATTTCCAGAC AATAATTTTT ATCTAACTCT 1560 

GTTCCAATCC ATTTTCTGCC AAAAGCTT 1588 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
ATATTTGAAG CCATGGTACC ACTGG 2 5 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
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GATGATTCAA ATTCTCACCA TGGAAG 2 6 

(2) INFORMATION FOR SEQ ID NO : 2 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 

AGATCTGGTC ATCCCAAACA AAAATCTTTC GGTTTACGAA GATGCAGTCG CTTCCTGGAA 60 

AGGCGAAAGT ATGAGCGAAT GGAAAAAGGA ATTCATCAAA AAAGCCAAAG ATTTCCCAJVT 12 0 

TCACAAGCCT TATCATCAAC TCACAAAAGA GCAGAAACAG TTCCTTTGGA AAGGCGATAA 18 0 

AACCAGAAGT TTCCCAAGTA TTGATAATTT TTTCAAAATG CTTGAAGAGA ATCTTTACAA 24 0 

GATCCAATAC CGCGTAATGC TTTCGCGCTA TCGTGGGAAA ACACTTTGCC CCGATTGCGA 300 

AGGATTACGA TTGCGGGAAG AAACAAGCTG GGTGAAGATT GACGGACACA ACATTCAGTC 3 60 

TTTGATTGAA TTACCTTTGG ATGAACTCCT GCCATTGATC AAAAGCTTAA AACTGAACGT 420 

CCACGACAGA GAAATTGCCA AACGCCTGAC TTACGAAATC GAAACGAGAT TAGAATTCCT 4 80 

GACGAAAGTC GGCCTTGGAT ATCTGACTTT GAACCGAACA TCCAACACGC TTTCCGGAGG 54 0 

AGAAAGCCAG AGAATCAATC TGGCGACAGC TTGGGAAGTT CGCTGGTTGG TTCTATTTAT 600 

ATTTTGGATG AGCCGAGCAT TGGTCTGCAT TCCCGCGATA CAGAAAATCT GATTGGTGTC 6 60 

CTCAAACAAC TCCGCGATTT GGGAANTACC GTGATTGTTG TAG/y^^CACGA CGAAGATGTG 7 20 

ATGCTTGCGG CAGNTTACAT TATAGATATT GGCCCNGNAG CGGGCTACCT TGGTGGCGAT 7 80 

CTTGTTTTCA GCGNGGATTA TAAAGAGATG CTGAAGTNTN ATACTTTAAC CGCAAAATAC 8 40 

CTGAATGGCG AACTGAAAAT AGAAGTTCCT GAAAAACGAA GAAAACCGAA GGAATTCATC 90 0 

GCAATAAAAG GTGCCCGCCA GAATAATTTA AAAAATATTG ACGTTGATGT TCCGTTAGAA 960 

TGTCTGACAG TTATCACAGG CGTTTCTGGA AGCGGGAAAT CCACTTTGAT GAAGGAAGTG 1020 

ATGACCAATG CCATCCAGAT CCAACTGGGA ATGGGCGGCA AAAAAGCCGA TTACGATTCG 108 0 

GTGGAATTCC CGAAAAAGCT GATCCAGAAT ATCGAACTGA TTGACCAGAA CCCAATCGGG 114 0 

AAATCGTCCC GCTCCAACCC CGTGACTTAT CTGAAAGCTT ACGACGATAT CCGGGATCTT 1200 

TTTGCGAAAC AAAAATCCGC AAAAATCCAG GGTTACAAAC CGAAGCATTT CTCCTTCAAT 1260 

GTGGATGGCG GAAGATGTGA CGAGTGCAAA GGCGAAGGTA TCATTACCGT ATCAATGCAG 132 0 
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TTTATGGCGG ACATCGAGCT GGAGTGTGAG CATTGCCATG GCACACGTTT CAAAAAAGAA 1380 

ATCCTCGAAG TCAAATATGA TGAGAAAAAC ATCTCAGACA TCCTGCATAT GACGGTGGAT 14 4 0 

GAAGCATTGG AATTTTTCTC GGAAAATCAC GAAGAAAAAA TTGTAACCAA ACTAAAACCT 15 00 

TTGCAGGACG TTGGTTTGGG TTATCTTCAG TTAGGCCAGT CCTCCTCTAC TCTTTCCGGC 1560 

GGTGAAGCCC AAAGAGTGAA GCTCGCCTCT TTCCTTGTGA AAGGTGTAAC GACGGAAAAA 1620 

ACGTTATTTG TTTTTGATGA ACCATCAACA GGATTACATT TCCACGACAT TCAAT^^TTA 168 0 

CTGAAATCAC TTCAGGCACT GATAGAATTA GGGCATTCGG TTGTAGTGAT TGAGCATCAG 17 4 0 

CCGGATATTA TCAAATGCGC CGATTACATC ATCGATGTCG GACCCAATGC CGGAAAATAC 18 00 

GGTGGCGAAA TTGTTTTCAC AGGAACTCCG GAAGATTTGG TAAAAGAGAA AAAGTCGTTT 18 60 

ACAGGGAAGT ATATTAAGGA GAAGTTAAAG TAATTTATTT ATATTTGAAG TTATGCTACC 192 0 

ACTGGATAAA GATTTACAAA AAGCAAAGAT TTCAATTACT GATTTTTTTG AAATTACAAA 198 0 

TAGAGTTTTA GATTATTTCC CCAATGTAAT CAATAATACA GTTGAAAAAG GAGATTATTT 20 4 0 

AATATCCTCA TCAAATATTG CTGGAACAAT AAAATTCCTA AGACCAATCA ATAGAAAGTT 210 0 

ATTTATTCAG GAAAAAAAAG TTTTCAATGA TTATTTTCAA AAACTGATTA TAGTTTTTGA 2160 

AAATATAAGG AACAAAAAAA CTGTAACAGA GGAAGATAAA ATTATTATTG ATAGGGTAAT 2220 

TTACACAATA CAGCAATCTA TTGGAATTGG TTTAGATTTA ATGGTTAATC AAAATAGTGC 228 0 

TAGAAAGCAC GTTGGTAACC GATTTGAAGA ATTAATTAGA GTCATTTTTA CAGAAATATC 23 4 0 

AGTATCGAAT AAAAGAACTG TATTACAAAT TCCATATGAA ACTGATGAAG GACAGAAAAT 24 00 

TTACAAATGC GAGAATGACC TCATTATTTC TCCTTTTGAA AATGTAGAAT CTACAAACAA 2 4 60 

ACATCTAGAT GAAAATGAGA TTGTTGTTTC AATAAAGACA ACATCAAAAG ATAGGATGGG 2 52 0 

AAAAATGTTT ATAGATAAAlA TTTTACTTGA AAGGTTTGTT AAACACCCTC AAAAAGTTAT 2580 

AGGGATTTTC CTCAATGATG TAC7UU\GAAA AGAAGACAAC AATATCAGCT TTACACTTGT 2 64 0 

TTCAGGATTA TTTATGGTGT ATACTAAATT CTTAACTACT CTTGAAGGGA TCTATTATTT 27 00 

AGATCCACCA CCTAATGCAT TGAAACTACC ATATTCTAAT CATATGAAAA GATTTTCAGA 27 60 

TTTAATTACA GAAGACCTTG AAAAATTATT CTCCTCTTAA TTTTTTTCTT CTATTTTGAG 2820 

AATTTGAATC ATCATCGCCT GAAAAAATCC TGTTGATCAT ACTTTCGTCT TTCAATCGCT 2880 

TTTGAATTTC CAGACAATAA TTTTTATCTA ACTCTGTTCC AATCCATTTT CTGCCAAAAG 2 94 0 

CTTCCGCAAC AGCATATGTA GTTCCCGAAC CACCAAATGG ATCAAGAATT AAACTATCTT 3 00 0 

TATTAGATGA AGATAAAATT ATTTTTTTAA GCAATTCTAC TGGCTTTTGC GTAGGGTGAA 30 60 

TACTTCTTTC CCAAGAACCA TTTGAAATTG TGGGAATCTC CAAAACATCT TTTGGCTTAG 3120 
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CTCCTAATGG GTTTGGCTCC CAAATATATT GCTTTTTTGA GTTCGAATAT TGCGAAGATT 318 0 

CGGCCTGGGT TCTTTGTGGA TATTTAACTG TATGCTCATT ATACGGGATT CGTGCCTCAT 32 4 0 

CAATATTAAA AATAAAATTT TTAGATTTTC TTAATAACAG TATACTTTCG TGTGAACGTC 3300 

CCCAATCTTT ACCTAAATTT GCCTTGTTTC TATAGAACCA AATCAACCAT TTACAACTGT 3360 

GAAAATATTT TGAAGTGATA AATTTTATGT CTGCCAGAAT TTCTGAAAAG CCACAAACAT 34 2 0 

ATAAACTGCC ATTATCTTTT AAAACTCTAT GAGCTTCCAT TATCCATTCT TTACTCCATT 34 80 

CGAGATATTC ATTTTGAGAA GAAAAAATAT CCCACTCTGC CTTTTTAATG TTATAAGGTG 35 4 0 

GATCTGCAAA AATTAAATCC ACAGAATTAC TCTCTAATTT TCTTAAAAAA TCAATAGCAT 3600 

CAACATTGAA AAGTGAACCG TTTTTTGAAA TAAAAAATGG TTTAAGCTCG ATTTCCGAAT 3 6 60 

AAAGTTTTAA TTTTGGTATA TAACTGTAAT TATCAAACGC TATTTCATTC ACAAATGAAT 37 2 0 

CAATCTGCTG TTGTGTATAA ACCCTGTAAT TATTAATAGG ATGTCTTAAA CTTTTGA/^.TT 3*7 8 0 

TTCCAGAATT ATCCCATCTT CCTTAATGTC TCAGAGTTAA CATCTAATAA TTTCGCCGCT 38 4 0 

TCTTTTATTG ATAAATAATC ATCCATATCT TACACAACAT TACAC7\AGTT TATACAGCAA 3 90 0 

ATATAAATAT TTTTTATACA TTGTAAAAAT TTTATTTACT TTTATTTTGT TCAATTGTCT 3 9 60 

CAATAAATAG TTAATCGAAA TACATTTTGA ATATGATAAA ATTGACTCCA ACAAATCTAA 4 02 0 

CACAATGACA TTAAAACCAA TTVAAAACGGA AG7\AGATTAC AATCAGGTTT TAGAAAGACT 4 0 80 

TTCAC7\AATT TTCGACGCTA AACCAAATAC CAAAGATGGA GATGAATTGG GAAATCTTGG 4 14 0 

GAATTC 4 14 6 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:30: 
ATTTAGGTGA CACTATAGAA TAG 2 3 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STEIANDEDNESS : double 

(D) TOPOLOGY : linear 
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(ii) MOLECULE TY^^^other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
AAGCTCGAGT CTGATGACGA AGCTTGACTG ACTGAGATCA GCTTGCAAC 4 9 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CTCGAGATTT AGGTGACACT ATAGAATACA AGCTT 3 5 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTCGAGATTT AGGTGACACT ATAGAATACG GAATTGTGAG CGGATAACAA TTCCAAGCTT 60 
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